competition format
PD$^3$: A Project Duplication Detection Framework via Adapted Multi-Agent Debate
Bao, Dezheng, Yang, Yueci, Chen, Xin, Jiang, Zhengxuan, Fei, Zeguo, Zhang, Daoze, Huang, Xuanwen, Chen, Junru, Yu, Chutian, Yuan, Xiang, Yang, Yang
Project duplication detection is critical for project quality assessment, as it improves resource utilization efficiency by preventing investing in newly proposed project that have already been studied. It requires the ability to understand high-level semantics and generate constructive and valuable feedback. Existing detection methods rely on basic word- or sentence-level comparison or solely apply large language models, lacking valuable insights for experts and in-depth comprehension of project content and review criteria. To tackle this issue, we propose PD$^3$, a Project Duplication Detection framework via adapted multi-agent Debate. Inspired by real-world expert debates, it employs a fair competition format to guide multi-agent debate to retrieve relevant projects. For feedback, it incorporates both qualitative and quantitative analysis to improve its practicality. Over 800 real-world power project data spanning more than 20 specialized fields are used to evaluate the framework, demonstrating that our method outperforms existing approaches by 7.43% and 8.00% in two downstream tasks. Furthermore, we establish an online platform, Review Dingdang, to assist power experts, saving 5.73 million USD in initial detection on more than 100 newly proposed projects.
- North America > United States (0.14)
- Asia > China (0.05)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Simulation leagues: Analysis of competition formats
Budden, David, Wang, Peter, Obst, Oliver, Prokopenko, Mikhail
The selection of an appropriate competition format is critical for both the success and credibility of any competition, both real and simulated. In this paper, the automated parallelism offered by the RoboCupSoccer 2D simulation league is leveraged to conduct a 28,000 game round-robin between the top 8 teams from RoboCup 2012 and 2013. A proposed new competition format is found to reduce variation from the resultant statistically significant team performance rankings by 75% and 67%, when compared to the actual competition results from RoboCup 2012 and 2013 respectively. These results are statistically validated by generating 10,000 random tournaments for each of the three considered formats and comparing the respective distributions of ranking discrepancy.
- Oceania > Australia (0.04)
- South America > Brazil (0.04)
- North America > United States > Virginia (0.04)
- Europe > Netherlands > North Brabant > Eindhoven (0.04)